Overview

Dataset statistics

Number of variables30
Number of observations6132
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)< 0.1%
Total size in memory1.4 MiB
Average record size in memory240.0 B

Variable types

Categorical8
Boolean1
Numeric21

Alerts

bathrooms has constant value "1"Constant
Dataset has 3 (< 0.1%) duplicate rowsDuplicates
host_neighbourhood has a high cardinality: 312 distinct valuesHigh cardinality
neighbourhood has a high cardinality: 187 distinct valuesHigh cardinality
city has a high cardinality: 262 distinct valuesHigh cardinality
property_type has a high cardinality: 63 distinct valuesHigh cardinality
accommodates is highly overall correlated with property_type and 3 other fieldsHigh correlation
bedrooms is highly overall correlated with property_type and 2 other fieldsHigh correlation
beds is highly overall correlated with property_type and 2 other fieldsHigh correlation
guests_included is highly overall correlated with extra_peopleHigh correlation
extra_people is highly overall correlated with guests_includedHigh correlation
availability_30 is highly overall correlated with availability_60 and 2 other fieldsHigh correlation
availability_60 is highly overall correlated with availability_30 and 2 other fieldsHigh correlation
availability_90 is highly overall correlated with availability_30 and 2 other fieldsHigh correlation
availability_365 is highly overall correlated with availability_30 and 2 other fieldsHigh correlation
review_scores_rating is highly overall correlated with review_scores_accuracy and 5 other fieldsHigh correlation
review_scores_accuracy is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
review_scores_cleanliness is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
review_scores_checkin is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
review_scores_communication is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
review_scores_location is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
review_scores_value is highly overall correlated with review_scores_rating and 5 other fieldsHigh correlation
property_type is highly overall correlated with room_type and 4 other fieldsHigh correlation
room_type is highly overall correlated with property_type and 1 other fieldsHigh correlation
security_deposit is highly overall correlated with cleaning_feeHigh correlation
cleaning_fee is highly overall correlated with security_depositHigh correlation
number_of_reviews is highly overall correlated with property_typeHigh correlation
extra_people has 3095 (50.5%) zerosZeros
availability_30 has 2239 (36.5%) zerosZeros
availability_60 has 1721 (28.1%) zerosZeros
availability_90 has 1351 (22.0%) zerosZeros
availability_365 has 744 (12.1%) zerosZeros
number_of_reviews has 1145 (18.7%) zerosZeros

Reproduction

Analysis started2022-12-12 01:05:29.287614
Analysis finished2022-12-12 01:07:01.182250
Duration1 minute and 31.89 seconds
Software versionpandas-profiling vv3.5.0
Download configurationconfig.json

Variables

Distinct312
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
Sunnyvale
558 
Santa Clara
450 
Cambridge
444 
Palo Alto
398 
Alum Rock
 
310
Other values (307)
3972 

Length

Max length31
Median length29
Mean length10.53441
Min length3

Characters and Unicode

Total characters64597
Distinct characters58
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)2.0%

Sample

1st rowMenlo Park
2nd rowMenlo Park
3rd rowMenlo Park
4th rowPalo Alto
5th rowSanta Clara

Common Values

ValueCountFrequency (%)
Sunnyvale 558
 
9.1%
Santa Clara 450
 
7.3%
Cambridge 444
 
7.2%
Palo Alto 398
 
6.5%
Alum Rock 310
 
5.1%
West Valley 271
 
4.4%
San Jose 259
 
4.2%
Mountain View 184
 
3.0%
Central San Jose 165
 
2.7%
Berryessa 161
 
2.6%
Other values (302) 2932
47.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
san 649
 
6.1%
jose 593
 
5.6%
sunnyvale 582
 
5.5%
palo 518
 
4.9%
alto 513
 
4.9%
santa 483
 
4.6%
clara 453
 
4.3%
cambridge 444
 
4.2%
west 333
 
3.1%
valley 323
 
3.1%
Other values (368) 5686
53.8%

Most occurring characters

ValueCountFrequency (%)
a 7095
 
11.0%
e 5522
 
8.5%
l 4913
 
7.6%
n 4890
 
7.6%
o 4646
 
7.2%
4445
 
6.9%
t 3469
 
5.4%
r 3368
 
5.2%
s 2456
 
3.8%
i 2317
 
3.6%
Other values (48) 21476
33.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 49200
76.2%
Uppercase Letter 10672
 
16.5%
Space Separator 4445
 
6.9%
Other Punctuation 188
 
0.3%
Dash Punctuation 76
 
0.1%
Decimal Number 16
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 7095
14.4%
e 5522
11.2%
l 4913
10.0%
n 4890
9.9%
o 4646
9.4%
t 3469
 
7.1%
r 3368
 
6.8%
s 2456
 
5.0%
i 2317
 
4.7%
u 1520
 
3.1%
Other values (17) 9004
18.3%
Uppercase Letter
ValueCountFrequency (%)
S 2013
18.9%
C 1536
14.4%
A 981
9.2%
P 851
8.0%
M 607
 
5.7%
W 598
 
5.6%
J 597
 
5.6%
V 572
 
5.4%
R 445
 
4.2%
G 371
 
3.5%
Other values (14) 2101
19.7%
Other Punctuation
ValueCountFrequency (%)
/ 155
82.4%
' 19
 
10.1%
. 14
 
7.4%
Decimal Number
ValueCountFrequency (%)
2 8
50.0%
4 8
50.0%
Space Separator
ValueCountFrequency (%)
4445
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 59872
92.7%
Common 4725
 
7.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 7095
 
11.9%
e 5522
 
9.2%
l 4913
 
8.2%
n 4890
 
8.2%
o 4646
 
7.8%
t 3469
 
5.8%
r 3368
 
5.6%
s 2456
 
4.1%
i 2317
 
3.9%
S 2013
 
3.4%
Other values (41) 19183
32.0%
Common
ValueCountFrequency (%)
4445
94.1%
/ 155
 
3.3%
- 76
 
1.6%
' 19
 
0.4%
. 14
 
0.3%
2 8
 
0.2%
4 8
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 64596
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 7095
 
11.0%
e 5522
 
8.5%
l 4913
 
7.6%
n 4890
 
7.6%
o 4646
 
7.2%
4445
 
6.9%
t 3469
 
5.4%
r 3368
 
5.2%
s 2456
 
3.8%
i 2317
 
3.6%
Other values (47) 21475
33.2%
None
ValueCountFrequency (%)
ğ 1
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
True
5287 
False
845 
ValueCountFrequency (%)
True 5287
86.2%
False 845
 
13.8%

neighbourhood
Categorical

Distinct187
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
San Jose
2166 
Palo Alto
588 
Santa Clara
442 
Sunnyvale
428 
Mountain View
423 
Other values (182)
2085 

Length

Max length24
Median length20
Mean length9.590835
Min length3

Characters and Unicode

Total characters58811
Distinct characters58
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)1.6%

Sample

1st rowMenlo Park
2nd rowMenlo Park
3rd rowMenlo Park
4th rowPalo Alto
5th rowSanta Clara

Common Values

ValueCountFrequency (%)
San Jose 2166
35.3%
Palo Alto 588
 
9.6%
Santa Clara 442
 
7.2%
Sunnyvale 428
 
7.0%
Mountain View 423
 
6.9%
San Francisco 374
 
6.1%
Los Gatos 142
 
2.3%
Menlo Park 137
 
2.2%
Cupertino 119
 
1.9%
Campbell 109
 
1.8%
Other values (177) 1204
19.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
san 2635
23.3%
jose 2214
19.6%
palo 705
 
6.2%
alto 705
 
6.2%
santa 447
 
4.0%
clara 442
 
3.9%
sunnyvale 432
 
3.8%
mountain 423
 
3.7%
view 423
 
3.7%
francisco 375
 
3.3%
Other values (202) 2510
22.2%

Most occurring characters

ValueCountFrequency (%)
a 7757
13.2%
n 5964
10.1%
o 5879
10.0%
5181
 
8.8%
e 4162
 
7.1%
S 3701
 
6.3%
s 3560
 
6.1%
l 3469
 
5.9%
t 2771
 
4.7%
J 2216
 
3.8%
Other values (48) 14151
24.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 42301
71.9%
Uppercase Letter 11295
 
19.2%
Space Separator 5181
 
8.8%
Decimal Number 16
 
< 0.1%
Dash Punctuation 13
 
< 0.1%
Other Letter 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 7757
18.3%
n 5964
14.1%
o 5879
13.9%
e 4162
9.8%
s 3560
8.4%
l 3469
8.2%
t 2771
 
6.6%
i 2060
 
4.9%
r 1693
 
4.0%
u 1074
 
2.5%
Other values (16) 3912
9.2%
Uppercase Letter
ValueCountFrequency (%)
S 3701
32.8%
J 2216
19.6%
P 899
 
8.0%
A 890
 
7.9%
C 813
 
7.2%
M 765
 
6.8%
V 474
 
4.2%
F 456
 
4.0%
L 270
 
2.4%
G 208
 
1.8%
Other values (14) 603
 
5.3%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Decimal Number
ValueCountFrequency (%)
2 8
50.0%
4 8
50.0%
Space Separator
ValueCountFrequency (%)
5181
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 53596
91.1%
Common 5212
 
8.9%
Han 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 7757
14.5%
n 5964
11.1%
o 5879
11.0%
e 4162
 
7.8%
S 3701
 
6.9%
s 3560
 
6.6%
l 3469
 
6.5%
t 2771
 
5.2%
J 2216
 
4.1%
i 2060
 
3.8%
Other values (40) 12057
22.5%
Common
ValueCountFrequency (%)
5181
99.4%
- 13
 
0.2%
2 8
 
0.2%
4 8
 
0.2%
, 2
 
< 0.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58807
> 99.9%
CJK 3
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 7757
13.2%
n 5964
10.1%
o 5879
10.0%
5181
 
8.8%
e 4162
 
7.1%
S 3701
 
6.3%
s 3560
 
6.1%
l 3469
 
5.9%
t 2771
 
4.7%
J 2216
 
3.8%
Other values (44) 14147
24.1%
None
ValueCountFrequency (%)
ğ 1
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

city
Categorical

Distinct262
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
San Jose
1664 
San Francisco
554 
Palo Alto
489 
New York
443 
Santa Clara
349 
Other values (257)
2633 

Length

Max length20
Median length19
Mean length9.6263862
Min length3

Characters and Unicode

Total characters59029
Distinct characters57
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)2.1%

Sample

1st rowMenlo Park
2nd rowSan Francisco
3rd rowMenlo Park
4th rowPalo Alto
5th rowMountain View

Common Values

ValueCountFrequency (%)
San Jose 1664
27.1%
San Francisco 554
 
9.0%
Palo Alto 489
 
8.0%
New York 443
 
7.2%
Santa Clara 349
 
5.7%
Sunnyvale 308
 
5.0%
Mountain View 268
 
4.4%
California 188
 
3.1%
Los Gatos 133
 
2.2%
Cupertino 110
 
1.8%
Other values (252) 1626
26.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
san 2362
21.2%
jose 1734
15.6%
francisco 555
 
5.0%
palo 531
 
4.8%
alto 531
 
4.8%
new 447
 
4.0%
york 443
 
4.0%
santa 359
 
3.2%
clara 349
 
3.1%
sunnyvale 316
 
2.8%
Other values (291) 3511
31.5%

Most occurring characters

ValueCountFrequency (%)
a 7206
12.2%
o 5766
 
9.8%
n 5595
 
9.5%
5006
 
8.5%
e 4327
 
7.3%
S 3295
 
5.6%
s 3284
 
5.6%
l 3112
 
5.3%
t 2546
 
4.3%
r 2500
 
4.2%
Other values (47) 16392
27.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 42847
72.6%
Uppercase Letter 11124
 
18.8%
Space Separator 5006
 
8.5%
Dash Punctuation 19
 
< 0.1%
Other Punctuation 17
 
< 0.1%
Decimal Number 16
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 7206
16.8%
o 5766
13.5%
n 5595
13.1%
e 4327
10.1%
s 3284
7.7%
l 3112
7.3%
t 2546
 
5.9%
r 2500
 
5.8%
i 2266
 
5.3%
c 1279
 
3.0%
Other values (16) 4966
11.6%
Uppercase Letter
ValueCountFrequency (%)
S 3295
29.6%
J 1737
15.6%
C 862
 
7.7%
A 730
 
6.6%
P 699
 
6.3%
F 653
 
5.9%
M 559
 
5.0%
N 493
 
4.4%
Y 449
 
4.0%
V 365
 
3.3%
Other values (15) 1282
 
11.5%
Other Punctuation
ValueCountFrequency (%)
' 16
94.1%
/ 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
4 8
50.0%
2 8
50.0%
Space Separator
ValueCountFrequency (%)
5006
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 53971
91.4%
Common 5058
 
8.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 7206
13.4%
o 5766
10.7%
n 5595
 
10.4%
e 4327
 
8.0%
S 3295
 
6.1%
s 3284
 
6.1%
l 3112
 
5.8%
t 2546
 
4.7%
r 2500
 
4.6%
i 2266
 
4.2%
Other values (41) 14074
26.1%
Common
ValueCountFrequency (%)
5006
99.0%
- 19
 
0.4%
' 16
 
0.3%
4 8
 
0.2%
2 8
 
0.2%
/ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 59028
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 7206
12.2%
o 5766
 
9.8%
n 5595
 
9.5%
5006
 
8.5%
e 4327
 
7.3%
S 3295
 
5.6%
s 3284
 
5.6%
l 3112
 
5.3%
t 2546
 
4.3%
r 2500
 
4.2%
Other values (46) 16391
27.8%
None
ValueCountFrequency (%)
ğ 1
100.0%

zipcode
Real number (ℝ)

Distinct78
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94770.345
Minimum93133
Maximum95976
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum93133
5-th percentile94036
Q194301
median95051
Q395125
95-th percentile95133
Maximum95976
Range2843
Interquartile range (IQR)824

Descriptive statistics

Standard deviation458.55583
Coefficient of variation (CV)0.0048386004
Kurtosis-1.2286191
Mean94770.345
Median Absolute Deviation (MAD)77
Skewness-0.7331783
Sum5.8113176 × 108
Variance210273.45
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
94301 337
 
5.5%
95112 309
 
5.0%
95128 287
 
4.7%
95051 240
 
3.9%
94303 215
 
3.5%
94306 197
 
3.2%
95125 179
 
2.9%
94043 169
 
2.8%
95127 167
 
2.7%
95126 163
 
2.7%
Other values (68) 3869
63.1%
ValueCountFrequency (%)
93133 6
 
0.1%
94022 88
1.4%
94024 46
 
0.8%
94025 131
2.1%
94027 6
 
0.1%
94028 28
 
0.5%
94035 1
 
< 0.1%
94036 74
1.2%
94039 67
1.1%
94040 116
1.9%
ValueCountFrequency (%)
95976 3
 
< 0.1%
95192 19
 
0.3%
95150 42
0.7%
95148 33
0.5%
95139 13
 
0.2%
95138 8
 
0.1%
95137 1
 
< 0.1%
95136 64
1.0%
95135 19
 
0.3%
95134 74
1.2%

property_type
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct63
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
Private room in home
1625 
Entire home
1149 
Entire rental unit
1060 
Entire guesthouse
342 
Entire guest suite
324 
Other values (58)
1632 

Length

Max length34
Median length31
Mean length17.773157
Min length3

Characters and Unicode

Total characters108985
Distinct characters37
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)0.2%

Sample

1st rowEntire guesthouse
2nd rowEntire rental unit
3rd rowPrivate room in condo
4th rowPrivate room
5th rowEntire rental unit

Common Values

ValueCountFrequency (%)
Private room in home 1625
26.5%
Entire home 1149
18.7%
Entire rental unit 1060
17.3%
Entire guesthouse 342
 
5.6%
Entire guest suite 324
 
5.3%
Entire serviced apartment 295
 
4.8%
Entire condo 168
 
2.7%
Private room in townhouse 148
 
2.4%
Private room in rental unit 142
 
2.3%
Entire townhouse 111
 
1.8%
Other values (53) 768
12.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
entire 3612
19.0%
home 2897
15.2%
room 2441
12.8%
in 2427
12.8%
private 2263
11.9%
rental 1213
 
6.4%
unit 1213
 
6.4%
guest 406
 
2.1%
suite 406
 
2.1%
guesthouse 368
 
1.9%
Other values (40) 1755
9.2%

Most occurring characters

ValueCountFrequency (%)
e 13034
12.0%
12869
11.8%
t 10618
9.7%
i 10408
9.5%
r 10266
9.4%
o 9439
8.7%
n 9413
8.6%
m 5692
 
5.2%
a 4564
 
4.2%
h 3714
 
3.4%
Other values (27) 18968
17.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 89908
82.5%
Space Separator 12869
 
11.8%
Uppercase Letter 6180
 
5.7%
Other Punctuation 28
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 13034
14.5%
t 10618
11.8%
i 10408
11.6%
r 10266
11.4%
o 9439
10.5%
n 9413
10.5%
m 5692
6.3%
a 4564
 
5.1%
h 3714
 
4.1%
u 3173
 
3.5%
Other values (13) 9587
10.7%
Uppercase Letter
ValueCountFrequency (%)
E 3613
58.5%
P 2263
36.6%
S 119
 
1.9%
R 84
 
1.4%
C 36
 
0.6%
T 25
 
0.4%
V 24
 
0.4%
F 8
 
0.1%
Y 3
 
< 0.1%
B 3
 
< 0.1%
Other values (2) 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
12869
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 96088
88.2%
Common 12897
 
11.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 13034
13.6%
t 10618
11.1%
i 10408
10.8%
r 10266
10.7%
o 9439
9.8%
n 9413
9.8%
m 5692
 
5.9%
a 4564
 
4.7%
h 3714
 
3.9%
E 3613
 
3.8%
Other values (25) 15327
16.0%
Common
ValueCountFrequency (%)
12869
99.8%
/ 28
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 108985
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 13034
12.0%
12869
11.8%
t 10618
9.7%
i 10408
9.5%
r 10266
9.4%
o 9439
8.7%
n 9413
8.6%
m 5692
 
5.2%
a 4564
 
4.2%
h 3714
 
3.4%
Other values (27) 18968
17.4%

room_type
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
Entire home/apt
3698 
Private room
2315 
Shared room
 
119

Length

Max length15
Median length15
Mean length13.789791
Min length11

Characters and Unicode

Total characters84559
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEntire home/apt
2nd rowEntire home/apt
3rd rowPrivate room
4th rowPrivate room
5th rowEntire home/apt

Common Values

ValueCountFrequency (%)
Entire home/apt 3698
60.3%
Private room 2315
37.8%
Shared room 119
 
1.9%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
entire 3698
30.2%
home/apt 3698
30.2%
room 2434
19.8%
private 2315
18.9%
shared 119
 
1.0%

Most occurring characters

ValueCountFrequency (%)
e 9830
11.6%
t 9711
11.5%
o 8566
10.1%
r 8566
10.1%
a 6132
 
7.3%
6132
 
7.3%
m 6132
 
7.3%
i 6013
 
7.1%
h 3817
 
4.5%
p 3698
 
4.4%
Other values (7) 15962
18.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 68597
81.1%
Space Separator 6132
 
7.3%
Uppercase Letter 6132
 
7.3%
Other Punctuation 3698
 
4.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 9830
14.3%
t 9711
14.2%
o 8566
12.5%
r 8566
12.5%
a 6132
8.9%
m 6132
8.9%
i 6013
8.8%
h 3817
 
5.6%
p 3698
 
5.4%
n 3698
 
5.4%
Other values (2) 2434
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
E 3698
60.3%
P 2315
37.8%
S 119
 
1.9%
Space Separator
ValueCountFrequency (%)
6132
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 3698
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 74729
88.4%
Common 9830
 
11.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 9830
13.2%
t 9711
13.0%
o 8566
11.5%
r 8566
11.5%
a 6132
8.2%
m 6132
8.2%
i 6013
8.0%
h 3817
 
5.1%
p 3698
 
4.9%
E 3698
 
4.9%
Other values (5) 8566
11.5%
Common
ValueCountFrequency (%)
6132
62.4%
/ 3698
37.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 84559
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 9830
11.6%
t 9711
11.5%
o 8566
10.1%
r 8566
10.1%
a 6132
 
7.3%
6132
 
7.3%
m 6132
 
7.3%
i 6013
 
7.1%
h 3817
 
4.5%
p 3698
 
4.4%
Other values (7) 15962
18.9%

accommodates
Real number (ℝ)

Distinct16
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.303816
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q34
95-th percentile8
Maximum16
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.45677
Coefficient of variation (CV)0.74361584
Kurtosis4.5573253
Mean3.303816
Median Absolute Deviation (MAD)1
Skewness1.8900864
Sum20259
Variance6.0357186
MonotonicityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
2 2402
39.2%
1 1053
17.2%
4 964
15.7%
6 499
 
8.1%
3 441
 
7.2%
8 236
 
3.8%
5 217
 
3.5%
10 93
 
1.5%
7 83
 
1.4%
9 39
 
0.6%
Other values (6) 105
 
1.7%
ValueCountFrequency (%)
1 1053
17.2%
2 2402
39.2%
3 441
 
7.2%
4 964
15.7%
5 217
 
3.5%
6 499
 
8.1%
7 83
 
1.4%
8 236
 
3.8%
9 39
 
0.6%
10 93
 
1.5%
ValueCountFrequency (%)
16 25
 
0.4%
15 4
 
0.1%
14 17
 
0.3%
13 6
 
0.1%
12 35
 
0.6%
11 18
 
0.3%
10 93
 
1.5%
9 39
 
0.6%
8 236
3.8%
7 83
 
1.4%

bathrooms
Categorical

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
1
6132 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters6132
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 6132
100.0%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
1 6132
100.0%

Most occurring characters

ValueCountFrequency (%)
1 6132
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6132
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 6132
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6132
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 6132
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 6132
100.0%

bedrooms
Real number (ℝ)

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5810502
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum8
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.0215999
Coefficient of variation (CV)0.6461527
Kurtosis4.5372599
Mean1.5810502
Median Absolute Deviation (MAD)0
Skewness2.0487448
Sum9695
Variance1.0436663
MonotonicityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 4153
67.7%
2 999
 
16.3%
3 558
 
9.1%
4 300
 
4.9%
5 81
 
1.3%
6 26
 
0.4%
7 11
 
0.2%
8 4
 
0.1%
ValueCountFrequency (%)
1 4153
67.7%
2 999
 
16.3%
3 558
 
9.1%
4 300
 
4.9%
5 81
 
1.3%
6 26
 
0.4%
7 11
 
0.2%
8 4
 
0.1%
ValueCountFrequency (%)
8 4
 
0.1%
7 11
 
0.2%
6 26
 
0.4%
5 81
 
1.3%
4 300
 
4.9%
3 558
 
9.1%
2 999
 
16.3%
1 4153
67.7%

beds
Real number (ℝ)

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9274299
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum16
Range15
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.496296
Coefficient of variation (CV)0.7763167
Kurtosis10.039869
Mean1.9274299
Median Absolute Deviation (MAD)0
Skewness2.5923656
Sum11819
Variance2.2389017
MonotonicityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 3460
56.4%
2 1275
 
20.8%
3 641
 
10.5%
4 378
 
6.2%
5 167
 
2.7%
6 103
 
1.7%
7 42
 
0.7%
8 29
 
0.5%
9 12
 
0.2%
10 9
 
0.1%
Other values (5) 16
 
0.3%
ValueCountFrequency (%)
1 3460
56.4%
2 1275
 
20.8%
3 641
 
10.5%
4 378
 
6.2%
5 167
 
2.7%
6 103
 
1.7%
7 42
 
0.7%
8 29
 
0.5%
9 12
 
0.2%
10 9
 
0.1%
ValueCountFrequency (%)
16 1
 
< 0.1%
14 3
 
< 0.1%
13 2
 
< 0.1%
12 3
 
< 0.1%
11 7
 
0.1%
10 9
 
0.1%
9 12
 
0.2%
8 29
 
0.5%
7 42
0.7%
6 103
1.7%

bed_type
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
Real Bed
5946 
Futon
 
94
Pull-out Sofa
 
39
Airbed
 
37
Couch
 
16

Length

Max length13
Median length8
Mean length7.9659165
Min length5

Characters and Unicode

Total characters48847
Distinct characters23
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowReal Bed
2nd rowReal Bed
3rd rowReal Bed
4th rowReal Bed
5th rowReal Bed

Common Values

ValueCountFrequency (%)
Real Bed 5946
97.0%
Futon 94
 
1.5%
Pull-out Sofa 39
 
0.6%
Airbed 37
 
0.6%
Couch 16
 
0.3%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
real 5946
49.1%
bed 5946
49.1%
futon 94
 
0.8%
pull-out 39
 
0.3%
sofa 39
 
0.3%
airbed 37
 
0.3%
couch 16
 
0.1%

Most occurring characters

ValueCountFrequency (%)
e 11929
24.4%
l 6024
12.3%
a 5985
12.3%
5985
12.3%
d 5983
12.2%
R 5946
12.2%
B 5946
12.2%
o 188
 
0.4%
u 188
 
0.4%
t 133
 
0.3%
Other values (13) 540
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 30706
62.9%
Uppercase Letter 12117
 
24.8%
Space Separator 5985
 
12.3%
Dash Punctuation 39
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 11929
38.8%
l 6024
19.6%
a 5985
19.5%
d 5983
19.5%
o 188
 
0.6%
u 188
 
0.6%
t 133
 
0.4%
n 94
 
0.3%
f 39
 
0.1%
i 37
 
0.1%
Other values (4) 106
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
R 5946
49.1%
B 5946
49.1%
F 94
 
0.8%
P 39
 
0.3%
S 39
 
0.3%
A 37
 
0.3%
C 16
 
0.1%
Space Separator
ValueCountFrequency (%)
5985
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 42823
87.7%
Common 6024
 
12.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 11929
27.9%
l 6024
14.1%
a 5985
14.0%
d 5983
14.0%
R 5946
13.9%
B 5946
13.9%
o 188
 
0.4%
u 188
 
0.4%
t 133
 
0.3%
n 94
 
0.2%
Other values (11) 407
 
1.0%
Common
ValueCountFrequency (%)
5985
99.4%
- 39
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48847
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 11929
24.4%
l 6024
12.3%
a 5985
12.3%
5985
12.3%
d 5983
12.2%
R 5946
12.2%
B 5946
12.2%
o 188
 
0.4%
u 188
 
0.4%
t 133
 
0.3%
Other values (13) 540
 
1.1%

square_feet
Real number (ℝ)

Distinct38
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean719.18056
Minimum0
Maximum3700
Zeros7
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile718.17813
Q1718.17813
median718.17813
Q3718.17813
95-th percentile718.17813
Maximum3700
Range3700
Interquartile range (IQR)0

Descriptive statistics

Standard deviation71.009767
Coefficient of variation (CV)0.098737049
Kurtosis684.38973
Mean719.18056
Median Absolute Deviation (MAD)0
Skewness17.781965
Sum4410015.2
Variance5042.387
MonotonicityNot monotonic
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
718.1781305 6053
98.7%
0 7
 
0.1%
900 7
 
0.1%
1000 6
 
0.1%
800 6
 
0.1%
850 5
 
0.1%
1200 4
 
0.1%
650 3
 
< 0.1%
700 3
 
< 0.1%
500 3
 
< 0.1%
Other values (28) 35
 
0.6%
ValueCountFrequency (%)
0 7
0.1%
16 1
 
< 0.1%
100 2
 
< 0.1%
125 1
 
< 0.1%
130 3
< 0.1%
193 1
 
< 0.1%
210 1
 
< 0.1%
290 1
 
< 0.1%
300 1
 
< 0.1%
350 2
 
< 0.1%
ValueCountFrequency (%)
3700 1
 
< 0.1%
2500 1
 
< 0.1%
2400 1
 
< 0.1%
2000 1
 
< 0.1%
1800 1
 
< 0.1%
1700 1
 
< 0.1%
1600 2
< 0.1%
1400 2
< 0.1%
1200 4
0.1%
1100 1
 
< 0.1%

security_deposit
Real number (ℝ)

Distinct74
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean177.21494
Minimum0
Maximum5100
Zeros9
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile100
Q1100
median100
Q3150
95-th percentile500
Maximum5100
Range5100
Interquartile range (IQR)50

Descriptive statistics

Standard deviation233.22778
Coefficient of variation (CV)1.3160729
Kurtosis136.27517
Mean177.21494
Median Absolute Deviation (MAD)0
Skewness9.1554526
Sum1086682
Variance54395.195
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 4222
68.9%
200 404
 
6.6%
500 305
 
5.0%
150 254
 
4.1%
300 250
 
4.1%
250 246
 
4.0%
400 67
 
1.1%
95 64
 
1.0%
1000 61
 
1.0%
350 46
 
0.8%
Other values (64) 213
 
3.5%
ValueCountFrequency (%)
0 9
 
0.1%
95 64
 
1.0%
97 1
 
< 0.1%
98 1
 
< 0.1%
99 4
 
0.1%
100 4222
68.9%
105 1
 
< 0.1%
109 2
 
< 0.1%
110 4
 
0.1%
115 1
 
< 0.1%
ValueCountFrequency (%)
5100 2
 
< 0.1%
5000 1
 
< 0.1%
4000 1
 
< 0.1%
3000 4
 
0.1%
2600 1
 
< 0.1%
2500 2
 
< 0.1%
2000 16
0.3%
1500 11
0.2%
1200 1
 
< 0.1%
1020 1
 
< 0.1%

cleaning_fee
Real number (ℝ)

Distinct96
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.106491
Minimum0
Maximum600
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile10
Q130
median50
Q350
95-th percentile100
Maximum600
Range600
Interquartile range (IQR)20

Descriptive statistics

Standard deviation33.141176
Coefficient of variation (CV)0.66141483
Kurtosis35.311958
Mean50.106491
Median Absolute Deviation (MAD)11.5
Skewness3.5774933
Sum307253
Variance1098.3375
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50 2557
41.7%
20 377
 
6.1%
25 372
 
6.1%
30 288
 
4.7%
15 262
 
4.3%
10 257
 
4.2%
100 239
 
3.9%
75 227
 
3.7%
40 181
 
3.0%
60 170
 
2.8%
Other values (86) 1202
19.6%
ValueCountFrequency (%)
0 1
 
< 0.1%
5 93
 
1.5%
6 2
 
< 0.1%
7 11
 
0.2%
8 13
 
0.2%
9 7
 
0.1%
10 257
4.2%
11 2
 
< 0.1%
12 17
 
0.3%
13 4
 
0.1%
ValueCountFrequency (%)
600 2
 
< 0.1%
450 1
 
< 0.1%
400 1
 
< 0.1%
300 2
 
< 0.1%
275 2
 
< 0.1%
250 8
 
0.1%
220 1
 
< 0.1%
200 21
0.3%
199 2
 
< 0.1%
188 1
 
< 0.1%

guests_included
Real number (ℝ)

Distinct13
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5114155
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum16
Range15
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.1092794
Coefficient of variation (CV)0.73393413
Kurtosis24.53533
Mean1.5114155
Median Absolute Deviation (MAD)0
Skewness3.8462254
Sum9268
Variance1.2305009
MonotonicityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 4392
71.6%
2 1105
 
18.0%
4 294
 
4.8%
3 204
 
3.3%
6 52
 
0.8%
5 52
 
0.8%
8 12
 
0.2%
10 8
 
0.1%
7 7
 
0.1%
12 2
 
< 0.1%
Other values (3) 4
 
0.1%
ValueCountFrequency (%)
1 4392
71.6%
2 1105
 
18.0%
3 204
 
3.3%
4 294
 
4.8%
5 52
 
0.8%
6 52
 
0.8%
7 7
 
0.1%
8 12
 
0.2%
10 8
 
0.1%
11 1
 
< 0.1%
ValueCountFrequency (%)
16 2
 
< 0.1%
14 1
 
< 0.1%
12 2
 
< 0.1%
11 1
 
< 0.1%
10 8
 
0.1%
8 12
 
0.2%
7 7
 
0.1%
6 52
 
0.8%
5 52
 
0.8%
4 294
4.8%

extra_people
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct61
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.943412
Minimum0
Maximum300
Zeros3095
Zeros (%)50.5%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q320
95-th percentile45
Maximum300
Range300
Interquartile range (IQR)20

Descriptive statistics

Standard deviation17.838468
Coefficient of variation (CV)1.4935823
Kurtosis36.995472
Mean11.943412
Median Absolute Deviation (MAD)0
Skewness3.9323798
Sum73237
Variance318.21095
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3095
50.5%
20 612
 
10.0%
25 519
 
8.5%
10 518
 
8.4%
15 417
 
6.8%
30 242
 
3.9%
50 178
 
2.9%
35 83
 
1.4%
5 78
 
1.3%
40 77
 
1.3%
Other values (51) 313
 
5.1%
ValueCountFrequency (%)
0 3095
50.5%
5 78
 
1.3%
6 5
 
0.1%
7 22
 
0.4%
8 16
 
0.3%
9 9
 
0.1%
10 518
 
8.4%
11 10
 
0.2%
12 25
 
0.4%
13 5
 
0.1%
ValueCountFrequency (%)
300 2
 
< 0.1%
250 1
 
< 0.1%
200 1
 
< 0.1%
180 1
 
< 0.1%
179 1
 
< 0.1%
150 3
 
< 0.1%
130 1
 
< 0.1%
125 2
 
< 0.1%
120 1
 
< 0.1%
100 30
0.5%

availability_30
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct31
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.323875
Minimum0
Maximum30
Zeros2239
Zeros (%)36.5%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median8
Q322
95-th percentile30
Maximum30
Range30
Interquartile range (IQR)22

Descriptive statistics

Standard deviation11.378576
Coefficient of variation (CV)1.0048306
Kurtosis-1.4009192
Mean11.323875
Median Absolute Deviation (MAD)8
Skewness0.41536213
Sum69438
Variance129.47199
MonotonicityNot monotonic
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
0 2239
36.5%
30 603
 
9.8%
18 178
 
2.9%
19 169
 
2.8%
29 163
 
2.7%
24 147
 
2.4%
26 127
 
2.1%
17 125
 
2.0%
1 124
 
2.0%
23 123
 
2.0%
Other values (21) 2134
34.8%
ValueCountFrequency (%)
0 2239
36.5%
1 124
 
2.0%
2 108
 
1.8%
3 98
 
1.6%
4 118
 
1.9%
5 107
 
1.7%
6 85
 
1.4%
7 101
 
1.6%
8 93
 
1.5%
9 107
 
1.7%
ValueCountFrequency (%)
30 603
9.8%
29 163
 
2.7%
28 112
 
1.8%
27 111
 
1.8%
26 127
 
2.1%
25 116
 
1.9%
24 147
 
2.4%
23 123
 
2.0%
22 82
 
1.3%
21 84
 
1.4%

availability_60
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct61
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.677267
Minimum0
Maximum60
Zeros1721
Zeros (%)28.1%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median29
Q350
95-th percentile60
Maximum60
Range60
Interquartile range (IQR)50

Descriptive statistics

Standard deviation23.184531
Coefficient of variation (CV)0.83767415
Kurtosis-1.6005325
Mean27.677267
Median Absolute Deviation (MAD)25
Skewness0.034183765
Sum169717
Variance537.52248
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1721
28.1%
60 578
 
9.4%
48 169
 
2.8%
59 158
 
2.6%
54 123
 
2.0%
49 122
 
2.0%
53 117
 
1.9%
58 108
 
1.8%
47 107
 
1.7%
17 103
 
1.7%
Other values (51) 2826
46.1%
ValueCountFrequency (%)
0 1721
28.1%
1 76
 
1.2%
2 60
 
1.0%
3 42
 
0.7%
4 53
 
0.9%
5 42
 
0.7%
6 35
 
0.6%
7 41
 
0.7%
8 36
 
0.6%
9 40
 
0.7%
ValueCountFrequency (%)
60 578
9.4%
59 158
 
2.6%
58 108
 
1.8%
57 90
 
1.5%
56 99
 
1.6%
55 95
 
1.5%
54 123
 
2.0%
53 117
 
1.9%
52 63
 
1.0%
51 48
 
0.8%

availability_90
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct91
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.520874
Minimum0
Maximum90
Zeros1351
Zeros (%)22.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q13
median54
Q379
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)76

Descriptive statistics

Standard deviation34.544028
Coefficient of variation (CV)0.742549
Kurtosis-1.552252
Mean46.520874
Median Absolute Deviation (MAD)31
Skewness-0.21546932
Sum285266
Variance1193.2899
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1351
22.0%
90 505
 
8.2%
89 194
 
3.2%
78 158
 
2.6%
88 112
 
1.8%
84 111
 
1.8%
83 109
 
1.8%
79 107
 
1.7%
77 99
 
1.6%
1 95
 
1.5%
Other values (81) 3291
53.7%
ValueCountFrequency (%)
0 1351
22.0%
1 95
 
1.5%
2 53
 
0.9%
3 37
 
0.6%
4 40
 
0.7%
5 35
 
0.6%
6 29
 
0.5%
7 28
 
0.5%
8 31
 
0.5%
9 23
 
0.4%
ValueCountFrequency (%)
90 505
8.2%
89 194
 
3.2%
88 112
 
1.8%
87 86
 
1.4%
86 88
 
1.4%
85 86
 
1.4%
84 111
 
1.8%
83 109
 
1.8%
82 64
 
1.0%
81 52
 
0.8%

availability_365
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct364
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.44423
Minimum0
Maximum365
Zeros744
Zeros (%)12.1%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q178
median180
Q3331
95-th percentile364
Maximum365
Range365
Interquartile range (IQR)253

Descriptive statistics

Standard deviation132.24188
Coefficient of variation (CV)0.67317772
Kurtosis-1.4865824
Mean196.44423
Median Absolute Deviation (MAD)134
Skewness-0.14969863
Sum1204596
Variance17487.914
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 744
 
12.1%
365 255
 
4.2%
364 189
 
3.1%
89 82
 
1.3%
363 69
 
1.1%
261 66
 
1.1%
179 63
 
1.0%
353 61
 
1.0%
352 60
 
1.0%
359 56
 
0.9%
Other values (354) 4487
73.2%
ValueCountFrequency (%)
0 744
12.1%
1 30
 
0.5%
2 17
 
0.3%
3 12
 
0.2%
4 18
 
0.3%
5 8
 
0.1%
6 8
 
0.1%
7 9
 
0.1%
8 10
 
0.2%
9 6
 
0.1%
ValueCountFrequency (%)
365 255
4.2%
364 189
3.1%
363 69
 
1.1%
362 37
 
0.6%
361 39
 
0.6%
360 45
 
0.7%
359 56
 
0.9%
358 54
 
0.9%
357 34
 
0.6%
356 25
 
0.4%

number_of_reviews
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct327
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.230594
Minimum0
Maximum795
Zeros1145
Zeros (%)18.7%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median8
Q337.25
95-th percentile157.45
Maximum795
Range795
Interquartile range (IQR)36.25

Descriptive statistics

Standard deviation64.202303
Coefficient of variation (CV)1.8755825
Kurtosis23.0212
Mean34.230594
Median Absolute Deviation (MAD)8
Skewness3.9644519
Sum209902
Variance4121.9357
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1145
 
18.7%
1 511
 
8.3%
2 323
 
5.3%
3 273
 
4.5%
4 224
 
3.7%
5 202
 
3.3%
6 146
 
2.4%
7 138
 
2.3%
8 112
 
1.8%
9 96
 
1.6%
Other values (317) 2962
48.3%
ValueCountFrequency (%)
0 1145
18.7%
1 511
8.3%
2 323
 
5.3%
3 273
 
4.5%
4 224
 
3.7%
5 202
 
3.3%
6 146
 
2.4%
7 138
 
2.3%
8 112
 
1.8%
9 96
 
1.6%
ValueCountFrequency (%)
795 1
< 0.1%
694 1
< 0.1%
690 1
< 0.1%
677 1
< 0.1%
644 1
< 0.1%
619 1
< 0.1%
591 1
< 0.1%
549 1
< 0.1%
537 1
< 0.1%
529 1
< 0.1%

review_scores_rating
Real number (ℝ)

Distinct126
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94.473912
Minimum0
Maximum100
Zeros19
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum0
5-th percentile80
Q194.473912
median96
Q399.4
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)4.9260878

Descriptive statistics

Standard deviation9.5176735
Coefficient of variation (CV)0.10074393
Kurtosis46.371829
Mean94.473912
Median Absolute Deviation (MAD)2.6
Skewness-5.9332765
Sum579314.03
Variance90.586109
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 1380
22.5%
94.47391218 1145
18.7%
80 149
 
2.4%
93.4 137
 
2.2%
90 124
 
2.0%
96.6 123
 
2.0%
97.2 120
 
2.0%
97.8 114
 
1.9%
98.8 113
 
1.8%
95 113
 
1.8%
Other values (116) 2614
42.6%
ValueCountFrequency (%)
0 19
0.3%
20 24
0.4%
40 15
0.2%
46.6 1
 
< 0.1%
50 3
 
< 0.1%
53.4 2
 
< 0.1%
60 33
0.5%
65 1
 
< 0.1%
66.6 6
 
0.1%
68 1
 
< 0.1%
ValueCountFrequency (%)
100 1380
22.5%
99.8 35
 
0.6%
99.6 67
 
1.1%
99.4 75
 
1.2%
99.2 82
 
1.3%
99 102
 
1.7%
98.8 113
 
1.8%
98.6 104
 
1.7%
98.4 98
 
1.6%
98.2 95
 
1.5%

review_scores_accuracy
Real number (ℝ)

Distinct110
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.5866224
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile8.5
Q19.5866224
median9.72
Q310
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.41337762

Descriptive statistics

Standard deviation0.69784356
Coefficient of variation (CV)0.072793475
Kurtosis45.444963
Mean9.5866224
Median Absolute Deviation (MAD)0.22
Skewness-5.5446787
Sum58785.168
Variance0.48698563
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 1581
25.8%
9.586622384 1164
19.0%
9.86 147
 
2.4%
9.84 132
 
2.2%
9.9 130
 
2.1%
9.88 125
 
2.0%
8 124
 
2.0%
9.76 120
 
2.0%
9.92 119
 
1.9%
9.94 119
 
1.9%
Other values (100) 2371
38.7%
ValueCountFrequency (%)
2 15
0.2%
3 1
 
< 0.1%
3.34 1
 
< 0.1%
4 9
 
0.1%
5 2
 
< 0.1%
6 26
0.4%
6.4 1
 
< 0.1%
6.5 1
 
< 0.1%
6.66 2
 
< 0.1%
7 27
0.4%
ValueCountFrequency (%)
10 1581
25.8%
9.98 62
 
1.0%
9.96 116
 
1.9%
9.94 119
 
1.9%
9.92 119
 
1.9%
9.9 130
 
2.1%
9.88 125
 
2.0%
9.86 147
 
2.4%
9.84 132
 
2.2%
9.82 113
 
1.8%

review_scores_cleanliness
Real number (ℝ)

Distinct132
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.4513929
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile8
Q19.4513929
median9.6
Q39.94
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.48860709

Descriptive statistics

Standard deviation0.81056963
Coefficient of variation (CV)0.085761923
Kurtosis29.88549
Mean9.4513929
Median Absolute Deviation (MAD)0.28
Skewness-4.4855104
Sum57955.941
Variance0.65702312
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 1390
22.7%
9.451392914 1164
19.0%
8 161
 
2.6%
9.34 134
 
2.2%
9 134
 
2.2%
9.6 111
 
1.8%
9.76 109
 
1.8%
9.78 107
 
1.7%
9.92 107
 
1.7%
9.9 107
 
1.7%
Other values (122) 2608
42.5%
ValueCountFrequency (%)
2 19
0.3%
3 2
 
< 0.1%
4 14
 
0.2%
4.66 1
 
< 0.1%
5 3
 
< 0.1%
5.34 2
 
< 0.1%
6 46
0.8%
6.34 2
 
< 0.1%
6.5 1
 
< 0.1%
6.58 2
 
< 0.1%
ValueCountFrequency (%)
10 1390
22.7%
9.98 46
 
0.8%
9.96 72
 
1.2%
9.94 87
 
1.4%
9.92 107
 
1.7%
9.9 107
 
1.7%
9.88 102
 
1.7%
9.86 105
 
1.7%
9.84 96
 
1.6%
9.82 81
 
1.3%

review_scores_checkin
Real number (ℝ)

Distinct90
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.7342339
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile9
Q19.7342339
median9.88
Q310
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.26576606

Descriptive statistics

Standard deviation0.61347279
Coefficient of variation (CV)0.063022195
Kurtosis69.890102
Mean9.7342339
Median Absolute Deviation (MAD)0.12
Skewness-7.2313206
Sum59690.323
Variance0.37634886
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 2122
34.6%
9.734233944 1165
19.0%
9.96 199
 
3.2%
9.92 193
 
3.1%
9.88 180
 
2.9%
9.9 166
 
2.7%
9.94 165
 
2.7%
9.86 128
 
2.1%
9.84 116
 
1.9%
9.82 108
 
1.8%
Other values (80) 1590
25.9%
ValueCountFrequency (%)
2 12
 
0.2%
4 11
 
0.2%
5 1
 
< 0.1%
5.34 1
 
< 0.1%
6 30
0.5%
6.66 1
 
< 0.1%
7 4
 
0.1%
7.2 2
 
< 0.1%
7.34 10
 
0.2%
7.38 1
 
< 0.1%
ValueCountFrequency (%)
10 2122
34.6%
9.98 95
 
1.5%
9.96 199
 
3.2%
9.94 165
 
2.7%
9.92 193
 
3.1%
9.9 166
 
2.7%
9.88 180
 
2.9%
9.86 128
 
2.1%
9.84 116
 
1.9%
9.82 108
 
1.8%
Distinct108
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.665781
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile8.66
Q19.665781
median9.84
Q310
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.334219

Descriptive statistics

Standard deviation0.71430114
Coefficient of variation (CV)0.073899993
Kurtosis53.529026
Mean9.665781
Median Absolute Deviation (MAD)0.16
Skewness-6.3053925
Sum59270.569
Variance0.51022612
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 2045
33.3%
9.665780998 1164
19.0%
9.94 182
 
3.0%
9.9 167
 
2.7%
9.96 159
 
2.6%
9.88 151
 
2.5%
9.92 145
 
2.4%
9.84 116
 
1.9%
9.86 109
 
1.8%
9.8 105
 
1.7%
Other values (98) 1789
29.2%
ValueCountFrequency (%)
2 20
0.3%
3.34 1
 
< 0.1%
4 9
 
0.1%
4.66 2
 
< 0.1%
5 2
 
< 0.1%
6 27
0.4%
6.34 1
 
< 0.1%
6.4 1
 
< 0.1%
6.5 2
 
< 0.1%
6.66 4
 
0.1%
ValueCountFrequency (%)
10 2045
33.3%
9.98 86
 
1.4%
9.96 159
 
2.6%
9.94 182
 
3.0%
9.92 145
 
2.4%
9.9 167
 
2.7%
9.88 151
 
2.5%
9.86 109
 
1.8%
9.84 116
 
1.9%
9.82 84
 
1.4%

review_scores_location
Real number (ℝ)

Distinct98
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.6276465
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile8.8
Q19.6276465
median9.72
Q310
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.37235353

Descriptive statistics

Standard deviation0.60700123
Coefficient of variation (CV)0.063047728
Kurtosis61.130946
Mean9.6276465
Median Absolute Deviation (MAD)0.2
Skewness-6.2812871
Sum59036.728
Variance0.3684505
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 1595
26.0%
9.627646466 1165
19.0%
9.9 139
 
2.3%
9.88 139
 
2.3%
9.86 139
 
2.3%
9.76 128
 
2.1%
9.84 127
 
2.1%
9.72 124
 
2.0%
9 117
 
1.9%
9.8 116
 
1.9%
Other values (88) 2343
38.2%
ValueCountFrequency (%)
2 12
0.2%
4 6
 
0.1%
4.66 1
 
< 0.1%
5 1
 
< 0.1%
5.34 1
 
< 0.1%
6 18
0.3%
6.4 1
 
< 0.1%
6.66 2
 
< 0.1%
7 10
0.2%
7.34 9
0.1%
ValueCountFrequency (%)
10 1595
26.0%
9.98 35
 
0.6%
9.96 97
 
1.6%
9.94 93
 
1.5%
9.92 105
 
1.7%
9.9 139
 
2.3%
9.88 139
 
2.3%
9.86 139
 
2.3%
9.84 127
 
2.1%
9.82 107
 
1.7%

review_scores_value
Real number (ℝ)

Distinct121
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.3638776
Minimum2
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size48.0 KiB

Quantile statistics

Minimum2
5-th percentile8
Q19.34
median9.46
Q39.8
95-th percentile10
Maximum10
Range8
Interquartile range (IQR)0.46

Descriptive statistics

Standard deviation0.82308958
Coefficient of variation (CV)0.087900506
Kurtosis30.672026
Mean9.3638776
Median Absolute Deviation (MAD)0.26
Skewness-4.5500962
Sum57419.297
Variance0.67747645
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9.363877592 1165
19.0%
10 1033
 
16.8%
8 191
 
3.1%
9 167
 
2.7%
9.34 151
 
2.5%
9.6 136
 
2.2%
9.66 131
 
2.1%
9.5 128
 
2.1%
9.72 124
 
2.0%
9.76 117
 
1.9%
Other values (111) 2789
45.5%
ValueCountFrequency (%)
2 24
0.4%
3 2
 
< 0.1%
4 10
 
0.2%
5 4
 
0.1%
5.34 1
 
< 0.1%
6 59
1.0%
6.5 5
 
0.1%
6.58 1
 
< 0.1%
6.66 8
 
0.1%
7 23
 
0.4%
ValueCountFrequency (%)
10 1033
16.8%
9.98 1
 
< 0.1%
9.96 13
 
0.2%
9.94 37
 
0.6%
9.92 44
 
0.7%
9.9 61
 
1.0%
9.88 74
 
1.2%
9.86 87
 
1.4%
9.84 74
 
1.2%
9.82 85
 
1.4%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size48.0 KiB
strict
2664 
flexible
2025 
moderate
1442 
no_refunds
 
1

Length

Max length10
Median length8
Mean length7.1314416
Min length6

Characters and Unicode

Total characters43730
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowmoderate
2nd rowflexible
3rd rowstrict
4th rowstrict
5th rowmoderate

Common Values

ValueCountFrequency (%)
strict 2664
43.4%
flexible 2025
33.0%
moderate 1442
23.5%
no_refunds 1
 
< 0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

ValueCountFrequency (%)
strict 2664
43.4%
flexible 2025
33.0%
moderate 1442
23.5%
no_refunds 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 6935
15.9%
t 6770
15.5%
i 4689
10.7%
r 4107
9.4%
l 4050
9.3%
s 2665
 
6.1%
c 2664
 
6.1%
f 2026
 
4.6%
b 2025
 
4.6%
x 2025
 
4.6%
Other values (7) 5774
13.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 43729
> 99.9%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 6935
15.9%
t 6770
15.5%
i 4689
10.7%
r 4107
9.4%
l 4050
9.3%
s 2665
 
6.1%
c 2664
 
6.1%
f 2026
 
4.6%
b 2025
 
4.6%
x 2025
 
4.6%
Other values (6) 5773
13.2%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 43729
> 99.9%
Common 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 6935
15.9%
t 6770
15.5%
i 4689
10.7%
r 4107
9.4%
l 4050
9.3%
s 2665
 
6.1%
c 2664
 
6.1%
f 2026
 
4.6%
b 2025
 
4.6%
x 2025
 
4.6%
Other values (6) 5773
13.2%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43730
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 6935
15.9%
t 6770
15.5%
i 4689
10.7%
r 4107
9.4%
l 4050
9.3%
s 2665
 
6.1%
c 2664
 
6.1%
f 2026
 
4.6%
b 2025
 
4.6%
x 2025
 
4.6%
Other values (7) 5774
13.2%

Interactions

Correlations

Auto

The auto setting is an interpretable pairwise column metric of the following mapping:
  • Variable_type-Variable_type : Method, Range
  • Categorical-Categorical : Cramer's V, [0,1]
  • Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
  • Numerical-Numerical : Spearman's ρ, [-1,1]
The number of bins used in the discretization for the Numerical-Categorical column pair can be changed using config.correlations["auto"].n_bins. The number of bins affects the granularity of the association you wish to measure.

This configuration uses the recommended metric for each pair of columns.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

host_neighbourhoodhost_identity_verifiedneighbourhoodcityzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsbed_typesquare_feetsecurity_depositcleaning_feeguests_includedextra_peopleavailability_30availability_60availability_90availability_365number_of_reviewsreview_scores_ratingreview_scores_accuracyreview_scores_cleanlinessreview_scores_checkinreview_scores_communicationreview_scores_locationreview_scores_valuecancellation_policy
0Menlo ParktMenlo ParkMenlo Park94301Entire guesthouseEntire home/apt1111Real Bed718.1781310050120111119100.00000010.00000010.0000009.9000009.90000010.0000009.780000moderate
1Menlo ParktMenlo ParkSan Francisco94025Entire rental unitEntire home/apt2111Real Bed718.178131005010222591090.0000009.6000008.4000009.8000009.8000009.6000008.200000flexible
2Menlo ParktMenlo ParkMenlo Park94025Private room in condoPrivate room1121Real Bed718.1781380010042511111111094.4739129.5866229.4513939.7342349.6657819.6276469.363878strict
3Palo AltotPalo AltoPalo Alto94301Private roomPrivate room1111Real Bed718.178131007510000587695.8000009.6200009.7600009.7200009.6600009.7800009.560000strict
4Santa ClarafSanta ClaraMountain View95051Entire rental unitEntire home/apt2111Real Bed718.178131001721500151051691.2000009.7600009.6200009.8800009.7600009.6200009.380000moderate
5Palo AltotPalo AltoPalo Alto94301Private roomPrivate room1111Real Bed718.178135003510001028326497.4000009.6400009.5800009.7000009.8000009.9200009.480000moderate
6Palo AltotPalo AltoPalo Alto94303Private room in homePrivate room2111Real Bed718.178131003021000026114098.2000009.9000009.9000009.8200009.9000009.9000009.660000flexible
7Palo AltotPalo AltoPalo Alto94301Private roomPrivate room2114Real Bed718.17813100201918282823714997.8000009.7800009.6600009.7000009.8200009.9200009.560000strict
8Palo AltotPalo AltoPalo Alto94301Private roomPrivate room1111Real Bed718.178131004031000227720897.2000009.7400009.8200009.8200009.8400009.9000009.540000moderate
9CupertinotCupertinoCupertino95129Entire guesthouseEntire home/apt4111Real Bed718.17813150504252326233718398.6000009.8400009.9600009.9200009.9000009.7800009.780000strict
host_neighbourhoodhost_identity_verifiedneighbourhoodcityzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsbed_typesquare_feetsecurity_depositcleaning_feeguests_includedextra_peopleavailability_30availability_60availability_90availability_365number_of_reviewsreview_scores_ratingreview_scores_accuracyreview_scores_cleanlinessreview_scores_checkinreview_scores_communicationreview_scores_locationreview_scores_valuecancellation_policy
6122East Palo AltotEast Palo AltoUnion City94301Private room in homePrivate room2111Real Bed718.1781310050109316124119100.00000010.0000009.90000010.00000010.0000009.90000010.000000flexible
6123East Palo AltotEast Palo AltoPleasanton94303Private room in villaPrivate room2111Real Bed718.17813100510306090901100.00000010.00000010.00000010.00000010.00000010.00000010.000000flexible
6124East Palo AltotEast Palo AltoEast Palo Alto94303Private room in homePrivate room2111Real Bed718.1781315065510001913495.8000009.8800009.6400009.7000009.8800008.8800009.640000strict
6125Menlo ParktMenlo ParkMenlo Park94025Private room in homePrivate room2111Real Bed718.1781310050101343733483199.4000009.8600009.8600009.9400009.9400009.8000009.580000flexible
6126East Palo AltotEast Palo AltoPalo Alto94303Entire guest suiteEntire home/apt4122Real Bed718.1781310010120018483232993.8000009.5800009.3800009.8000009.7200009.1800008.900000strict
6127Menlo ParktMenlo ParkMenlo Park94025Entire homeEntire home/apt5134Real Bed718.17813100510333177094.4739129.5866229.4513939.7342349.6657819.6276469.363878strict
6128Downtown NorthtMenlo ParkMenlo Park94025Private room in rental unitPrivate room2111Real Bed718.17813100501400434309094.4739129.5866229.4513939.7342349.6657819.6276469.363878flexible
6129Los GatostLos GatosLos Gatos95033Entire guest suiteEntire home/apt4122Real Bed718.178131002510174473735887.2000009.2400008.8000009.2400009.2800009.4200009.060000strict
6130SunnyvaletSan FranciscoSan Francisco94025Entire serviced apartmentEntire home/apt2111Real Bed718.178132005010113278592.00000010.0000009.20000010.0000008.80000010.0000008.000000strict
6131East Palo AltotEast Palo AltoEast Palo Alto94301Entire guest suiteEntire home/apt6133Real Bed718.17813100501151848781682494.2000009.7600009.6600009.7600009.7600008.8400009.260000moderate

Duplicate rows

Most frequently occurring

host_neighbourhoodhost_identity_verifiedneighbourhoodcityzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsbed_typesquare_feetsecurity_depositcleaning_feeguests_includedextra_peopleavailability_30availability_60availability_90availability_365number_of_reviewsreview_scores_ratingreview_scores_accuracyreview_scores_cleanlinessreview_scores_checkinreview_scores_communicationreview_scores_locationreview_scores_valuecancellation_policy# duplicates
0CambridgetMenlo ParkNew York94028Entire rental unitEntire home/apt2111Real Bed718.1781310050100000094.4739129.5866229.4513939.7342349.6657819.6276469.363878flexible2
1CambridgetSan JoseNew York95129Entire serviced apartmentEntire home/apt3112Real Bed718.178131005010306090365094.4739129.5866229.4513939.7342349.6657819.6276469.363878flexible2
2OrangetPalo AltoSan Francisco94306Room in boutique hotelPrivate room2111Real Bed718.17813100501043464339094.4739129.5866229.4513939.7342349.6657819.6276469.363878strict2